On Segment-Based Stream Modeling and Its Applications

نویسنده

  • Charu C. Aggarwal
چکیده

The primary constraint in the effective mining of data streams is the large volume of data which must be processed in real time. In many cases, it is desirable to store a summary of the data stream segments in order to perform data mining tasks. Since density estimation provides a comprehensive overview of the probabilistic data distribution of a stream segment, it is a natural choice for this purpose. A direct use of density distributions can however turn out to be an inefficient storage and processing mechanism in practice. In this paper, we introduce the concept of cluster histograms, which provides an efficient way to estimate and summarize the most important data distribution profiles over different stream segments. These profiles can be constructed in a supervised or unsupervised way depending upon the nature of the underlying application. The profiles can also be used for change detection, anomaly detection, segmental nearest neighbor search, or supervised stream segment classification. The flexibility of the tasks which can be performed from the cluster histogram framework follows from its generality in storing the historical density profile of the data stream. As a result, this method provides a holistic framework for density based mining of data streams. We discuss and test the application of the cluster histogram framework to a variety of interesting data mining applications such as speaker recognition and intrusion detection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An integrated framework for knowledge - based modeling and simulation of natural systems

This paper proposes a new approach to simulation modeling of natural systems in the context of water quality modeling in streams affected by point source pollution. The approach has a potential for application to other domains of natural resource modeling. Its conceptual basis is knowledge-based simulation and systems analysis. In the approach presented in this paper, a stream or its section is...

متن کامل

The Influence of DC-Link Voltage on Commutation Torque Ripple of Brushless DC Motors with Two-Segment Pulse-width Modulation Control Method

The commutation process causes current ripple to be generated in the drive system of brushless DC (BLDC) motor. This, in turn, leads to output torque ripple. Mechanical vibration and acoustic noise are its influences which are undesirable phenomenon in some applications. A new method is presented in this paper which reduces torque ripple and commutation period in the entire range of motor speed...

متن کامل

Separation of Geochemical Anomalies Using Factor Analysis and Concentration-Number (C-N) Fractal Modeling Based on Stream Sediments Data in Esfordi 1:100000 Sheet, Central Iran

The aim of this study is separation of Fe2O3, TiO2 and V2O5 anomalies in Esfordi 1:100,000 sheet which is located in Bafq district, Central Iran. The analyzed elements of stream sediment samples taken in the area can be classified into 5 groups (factors) by factor analysis. The Concentration–Number (C-N) fractal model was used for delineation of the Fe2O3, TiO2 and V2O5 thresholds. According to...

متن کامل

A review of agent-based modeling (ABM) concepts and some of its main applications in management science

We live in a very complex world where we face complex phenomena such as social norms and new technologies. To deal with such phenomena, social scientists often use reductionism approach where they reduce them to some lower-lever variables and model the relationships among them through a scheme of equations. This approach that is called equation based modeling (EBM) has some basic weaknesses in ...

متن کامل

Design and Modeling of a New Type of Tactile Sensor Based on the Deformation of an Elastic Membrane

This paper presents the design and modeling of a flexible tactile sensor, capable of detecting the 2D surface texture image, contact-force estimation and stiffness of the sensed object. The sensor is made of polymer materials. It consists of a cylindrical chamber for pneumatic actuation and a membrane with a mesa structure. The inner radius of the cylindrical chamber is 2cm and its outer radius...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009